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showed  a  small  but  systematic  tendency  for  the  model  to  underestimate 
the  phase  propagation  of  weather  disturbances.  The  effect  was  not 
considered  to  be  large  or  systematic  enough  to  warrant  empirical 
corrections . 


DETERMINATION  OF  THE  SYSTEMATIC  BIAS  AND 
VARIABILITY  OF  THE  ERROR  OF  THE 
NATIONAL  METEOROLOGICAL  CENTER’S 
SIX-LAYER  BAROCLINIC  MODEL 


by 


JAMES  KEIL  WOESSNER 


A  thesis  submitted  in  partial  fulfillment 
of  the  requirements  for  the  degree  of 
MASTER  OF  SCIENCE 
UNIVERSITY  OF  WASHINGTON 
1980 


Approved  by  _ 

Chairperson  of  Supervisory  Committee 

Program  Authorized 

to  Offer  Degree  _ 


Date 


AFIT  RESEARCH  ASSESSMENT 

The  purpose  of  this  questionnaire  is  to  ascertain  the  value  and/or 
contribution  of  research  accomplished  by  students  or  faculty  of  the 
Air  Force  Institute  of  Technology  (ATC).  It  would  be  greatly  appreciated 
if  you  would  complete  the  following  questionnaire  and  return  it  to: 

AFIT/NR 

Wright-Patterson  AFB  OH  45433 

Research  Title:  Determination  of  the  Systematic  Bias  and  Variability 
of  the  Error  of  TFT5  N s 1 16 h a  1  Mutuuruluyi cal  Center's  S iT^tuyH  r 
Baroclinic  Model _ 

Author:  James  Keil  Woessner 

Research  Assessment  Questions: 

1.  Did  this  research  contribute  to  a  current  Air  Force  project? 


2.  Do  you  believe  this  research  topic  is  significant  enough  that  it  would 
have  been  researched  (or  contracted)  by  your  organization  or  another  agency 
if  AFIT  had  not? 


3.  The  benefits  of  AFIT  research  can  often  be  expressed  by  the  equivalent 
value  that  your  agency  achieved/received  by  virtue  of  AFIT  performing  the 
research.  Can  you  estimate  what  this  research  would  have  cost  if  it  had 
been  accomplished  under  contract  or  if  it  had  been  done  in-house  in  terms 
of  manpower  and/or  dollars? 

a.  Nan-years  _ .  b.  $ _ 

4.  Often  it  is  not  possible  to  attach  equivalent  dollar  values  to  research, 
although  the  results  of  the  research  may,  in  fact,  be  important.  Whether  or 
not  you  were  able  to  establish  an  equivalent  value  for  this  research  (3  above), 
what  is  your  estimate  of  its  significance? 


2.  Highly 

Significant 


b.  Significant  c.  Slightly 

Significant 


d.  Of  No 

Significance 


5.  AFIT  welcomes  any  further  comments  you  may  have  on  the  above  questions, 
or  any  additional  details  concerning  the  current  application,  future  potential, 
or  other  value  of  this  research.  Please  use  the  bach  of  this  questionnaire 
for  your  stateaent(g) , 


GRADE 


ORGANIZATION 


POSITION 


LOCATION 


USAF  SCN  75-20R 


University  of  Washington 
Abstract 


/» 


DETERMINATION  OF  THE  SYSTEMATIC  BIAS  AND 
VARIABILITY  OF  THE  ERROR  OF  THE 
NATIONAL  METEOROLOGICAL  CENTER'S 
SIX-LAYER  BAROCLINIC  MODEL 

By  James  Keil  Woessner 

Chairperson  of  the  Supervisory  Committee: 

Professor  John  M.  Wallace 
Department  of  Atmospheric  Sciences 

The  geographical  distribution  of  the  bias  and  variability  of 
the  error  in  the  500  and  1000  mb  geopotential  height  forecasts  pro¬ 
duced  by  the  National  Meteorological  Center's  operational  six-layer 
baroclinic  model  was  investigated.  Forecasts  and  the  verifying  ob¬ 
servations  for  the  six  winter  seasons  beginning  with  the  1973-74 
winter  were  used.  It  was  found  that  the  model  systematically  damps 
out  the  amplitude  of  the  climatological  mean  surface  highs  and 
lows,  and  500  mb  troughs  and  ridges.  Typical  values  of  the  bias 
in  the  mean  500  mb  standing  wave  troughs  are  on  the  order  of  60 
meters:  in  these  regions  the  bias  accounts  for  up  to  30%  of  the 
root  mean  square  error.  Throughout  the  forecast  period,  the  stan¬ 
dard  deviation  of  the  forecast  error  about  its  own  mean  shows  re¬ 
gional  contrasts  similar  to  the  climatological  standard  deviation 
patterns.  These  non-systematic  forecast  errors  tend  to  be  largest 
in  the  "storm  tracks"  over  the  western  oceans  and  quite  small  at 
low  latitudes.  Normalization  of  the  standard  deviation  of  the 
error  by  the  climatological  standard  deviation  of  the  observation 


results  In  a  relatively  featureless  field  except  for  a  monotonic 
pole-to-equator  increase.  The  normalized  errors  are  larger  at  the 
1000  mb  level  than  at  the  500  mb  level  over  most  of  the  hemisphere 
but  they  are  of  comparable  magnitude  over  the  eastern  Atlantic  and 
Europe . 

The  vertical  structure  of  the  error  field,  as  manifested  in 
the  geographical  distributions  of  the  correlation  between  the  1000 
and  500  mb  forecast  error,  and  the  ratio  of  the  amplitudes  of  error 
fluctuations  at  those  levels,  was  found  to  be  similar  to  that  of 
real  atmospheric  fluctuations. 

An  investigation  of  phase  errors  based  on  cross  spectral  anal¬ 
ysis  between  time  series  of  72-hour  forecasts  and  concurrent  obser¬ 
vations  showed  a  small  but  systematic  tendency  for  the  model  to 
underestimate  the  phase  propagation  of  weather  disturbances.  The 
effect  was  not  considered  to  be  large  or  systematic  enough  to 


warrant  empirical  corrections 


TABLE  OF  CONTENTS 


Page 

LIST  OF  TABLES  ill 

LIST  OF  FIGURES  iv 

ACKNOWLEDGMENTS  vil 

1.  INTRODUCTION  1 

2.  DATA  DESCRIPTION  6 

3.  PRELIMINARY  COMPARISON  OF  FORECAST  TO  CLIMATOLOGY  6 

4.  STATISTICAL  METHODS  12 

5.  INVESTIGATION  OF  EFFECTS  OF  DIFFERENT  ANALYSIS 

PROCEDURES  18 

6.  ERROR  STATISTICS  19 

a.  Systematic  Errors  19 

b.  Non-Systematic  Errors  29 

c.  Vertical  Structure  of  the  Error  40 

d.  Systematic  Phase  Errors  46 

7.  DISCUSSION  51 

8.  REFERENCES  53 


ii 


-»v . 


S' 


LIST  OF  TABLES 


TABLE 

PAGE 

1 

Cross  spectral  analysis  between  forecast  and 
observation  time  series  at  selected  grid  points 

49 

ill 


LIST  OF  FIGURES 


FIGURE 

1 

2 

3 

4 

5 

6 

7 

8 

9 

10a 

10b 


PAGE 

Yearly  mean  S^  scores  for  NMC's  36-hour  sea-level 
prognoses  from  the  PE,  model,  30-hour  500  mb  prog¬ 
noses  from  the  three-layer  baroclinic  model  1962- 
1966  and  the  six- layer  PE  model  1966-1975,  and 
36-hour  500  mb  manual  prognoses,  (taken  from  Faw¬ 
cett,  1977)  3 

Mean  montly  observed  500  mb  contours  in  decameters 
(solid  lines)  with  mean  observed  jet  stream  core 
positions  (heavy  arrows)  and  mean  error  patterns  for 


(a)  36-hour  three  level  baroclinic,  October  1965 

and  (b)  36-hour  PE  model,  October  1968.  4 

Mean  observed  geopotential  height  at  (a)  500  mb 
and  (b)  1000  mb  based  on  six  winter  seasons;  con¬ 
tour  interval  for  (a)  60  m  and  for  (b)  20  m.  8 

Standard  deviation  of  the  observed  geopotential 

height  for  (a)  500  mb  and  (b)  1000  mb  based  on 

daily  data  for  winter;  contour  interval  10  m.  9 

As  in  Fig.  3  except  for  72-hour  PE  model  fore¬ 
cast,  10 

As  in  Fig.  4  except  for  72-hour  PE  model  fore¬ 
cast.  11 

Schematic  relationship  between  the  forecast, 
observation,  and  forecast  error  statistics  for 
a  particular  grid  point.  14 

Qualitative  comparison  of  "perfect  model"  fore¬ 
cast  skill  with  that  of  persistence,  climatology, 
and  present  models,  as  measured  by  the  r.m.s, 

500  mb  height  error  (Leith,  1976).  16 


Mean  error  for  the  500  mb  72-hour  forecast  for 

the  period  in  which  the  (a)  Flattery-Hough  and 

(b)  Optimum  Interpolation  objective  analysis 

schemes  were  operational;  contour  interval  10  m.  20 

Mean  error  for  the  500  mb  24-hour  forecast;  con¬ 
tour  interval  10  m.  21 

As  in  Fig.  10a  except  for  48-hour  forecast,  22 

iv 


i 


FIGURE 

10c 

11a 

lib 

11c 

12 

13 

14a 

14b 

14c 

15a 

15b 

15c 

16 

17 

18 

19 


PAGE 

As  in  Fig.  10a  except  for  72-hour  forecast.  Dots 
indicate  grid  points  at  which  cross  spectrum  anal¬ 


ysis  was  performed.  23 

Mean  error  for  the  1000  mb  24-hour  forecast;  con¬ 
tour  interval  10  m.  25 

As  in  Fig.  11a  except  for  48-hour  forecast.  26 

As  in  Fig.  11a  except  for  72-hour  forecast.  27 

Mean  500-1000  mb  thickness  error  for  the  72-hour 
forecast;  contour  interval  10  m.  28 


Ratio  of  the  standard  deviation  of  the  500  mb  72- 
hour  forecast  to  the  500  mb  climatological  stan¬ 
dard  deviation  of  the  observation;  contour  inter¬ 


val  0.1.  30 

Standard  deviation  of  the  500  mb  24-hour  forecast 
error;  contour  interval  10  m.  32 

As  in  Fig.  14a  except  for  48-hour  forecast.  33 

As  in  Fig.  14a  except  for  72-hour  forecast.  34 

Standard  deviation  of  the  1000  mb  24-hour  fore¬ 
cast  error;  contour  interval  10  m.  35 

As  in  Fig.  15a  except  for  48-hour  forecast.  36 

As  in  Fig.  15a  except  for  72-hour  forecast.  37 

Normalized  standard  deviation  of  the  error  for 
the  (a)  500  mb  and  (b)  1000  mb  72-hour  forecasts; 
contour  interval  0.1.  38 

Correlation  coefficients  between  the  forecast 
and  observation  for  the  500  mb  72-hour  forecast; 
contour  interval  0.1.  39 

Area-averaged  r.m.s.  error,  standard  deviation 

of  the  error,  and  the  mean  error  for  (a)  500  mb 

and  (b)  1000  mb  forecasts.  41 

Percent  of  total  variance  of  the  error  due  to 
the  bias  for  the  500  mb  72-hour  forecast;  con¬ 
tour  interval  0.1.  42 


v 


FIGURE 

PAGE 

20 

Ratio  of  the  500  to  1000  mb  standard  deviation 
of  the  error  for  the  72-hour  forecast;  contour 
interval  0.2. 

43 

21 

Correlation  coefficients  between  the  500  and 

1000  mb  mean  errors  for  the  (a)  72-hour  and 
(b)  24-hour  forecast;  contour  interval  0.1, 

45 

22 

Ratio  of  the  500  to  1000  mb  normalized  standard 
deviation  of  the  error  for  the  72-hour  forecast; 
contour  interval  0.1. 

47 

vi 


ACKNOWLEDGMENTS 


I  wish  to  thank  John  M.  Wallace  for  suggesting  this  topic  and 
for  his  guidance  and  critical  insight  throughout  the  course  of 
this  project.  He  has  made  my  experience  at  the  University  of 
Washington  both  challenging  and  personally  rewarding.  I  wish  to 
express  my  appreciation  to  Harold  J.  Edmon  and  Ernest  E.  Recker  for 
their  assistance  with  computer  graphics  and  programming.  Thanks 
are  also  due  to  Lisa  Link  who  typed  the  manuscript  in  a  professional 
manner . 

The  data  set  used  in  the  study  was  obtained  from  NCAR  with  the 
assistance  of  Dr.  Dennis  Joseph.  The  computer  funds  for  this  work 
were  provided  by  the  Climate  Dynamics  Program  in  the  Atmospheric 
Sciences  Division  of  the  National  Science  Foundation  under  Grant 
ATM78-07369 . 


vii 


I 

1 


1.  Introduction 

Since  1954,  operational  numerical  weather  prediction  (NWP) 
has  grown  in  sophistication  and  accuracy  which  has  paralled  the 
growth  and  development  of  computer  technology.  The  first  opera¬ 
tionally  useful  NWP  forecasts  were  issued  in  the  late  1950's  from 
the  early  beginnnings  of  the  National  Meteorological  Center  (NMC) . 
The  first  numerical  forecasts  were  based  on  a  barotropic  model  co¬ 
vering  most  to  the  Northern  Hemisphere  (Cressman,  1958).  In  1962, 
a  three-level  baroclinic  model  (Cressman,  1963)  based  on  the  primi¬ 
tive  equations  was  put  into  operational  use.  Since  1966,  a  hemi¬ 
spheric  six- layer  baroclinic  model  (Shuman  and  Hovermale,  1968) 
was  developed.  A  seventh  layer  was  added  in  1976.  That  model, 
together  with  the  barotropic  model  and  a  regional  Limited-Area 
Fine  Mesh  model  (Howcraft,  1971)  which  was  added  in  1971,  make 
up  the  NWP  guidance  used  today.  Operational  forecasts  have  nor¬ 
mally  been  issued  out  to  48  or  72  hours  and  as  long  as  84  hours 
in  the  case  of  the  six-layer  baroclinic  model.  Over  the  past 
25  years,  improvement  has  been  substantial. 

Skill  of  NWP  model  forecasts  has  been  measured  in  many  ways, 
but  the  earliest  and  still  on-going  measurement  of  skill  is  NMC’s 
SI  score  (Teweles  and  Wobus,  1954).  This  score  is  a  measure  of 
the  error  in  the  pressure  gradient  between  standard  grid  points;  a 
lower  score  is  indicative  of  greater  skill.  The  reader  can  ob¬ 
tain  an  appreciation  of  how  much  improvement  has  been  made  since 
the  early  stages  of  NWP  from  an  inspection  of  the  SI  score  shown 


in  Fig.  1  (from  Fawcett,  1977). 

Improvement  of  NWP  model  forecasts  can  be  attributed  to  a 
number  of  reasons.  The  most  obvious  of  these  are:  a  better 

understanding  of  the  numerics,  the  basic  dynamics,  and  the  physics 
included  in  all  models;  the  development  of  faster,  more  sophis¬ 
ticated  computers  allowing  a  greater  space  and  time  resolution; 
the  ability  to  more  accurately  specify  the  initial  state  of  the 
atmosphere  through  development  of  objective  analysis  routines 
capable  of  including  both  conventional  and  new  types  of  meteoro¬ 
logical  information;  and  through  detection  of  model  deficiencies 
found  by  verification  programs. 

The  primary  purpose  of  any  model  verfication  scheme  is  to 
document  and  evaluate  the  model  performance  characteristics  with 
hopes  of  better  understanding  the  strengths  and  weaknesses  of  phy¬ 
sical  and  dynamical  treatments  incorporated  into  the  models.  Faw¬ 
cett  (1969)  undertook  a  study  to  determine  the  systematic  errors 
in  the  36  hour  PE  at  the  500  mb  level.  One  result  of  Fawcett's 
study  was  to  correct  the  negative  bias  over  the  Canadian  Rocky 
Mountains.  This  error  was  diagnosed  as  a  result  of  overly  smoothed 
mountains  in  Western  North  America.  After  higher  mountains  were 
introduced  into  the  PE  in  September,  1968  (Weather  Analysis  and 
Prediction  Division,  1969),  the  negative  bias  was  reduced  signi¬ 
ficantly.  Shown  in  Fig.  2  [from  Fawcett  (1969)  Figs.  5b  and  8c] 
are  the  mean  daily  error  patterns  for  the  months  of  October,  1965 
and  October,  1968.  In  another  study  of  the  PE,  Leary  (1970) 


Mean  monthly  observed  500mb  contours  in  decameters  (solid 
lines)  with  mean  observed  jet  stream  core  positions (heavy 
arrows)  and  mean  error  patterns  for  (a)  36-hour  three  level 
barocllnic,  October,  1965  and  (b)  36-hour  PE  model,  October 
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documented  the  systematic  errors  at  the  1000  mb  level  by  compositing 
forecast  errors  of  the  central  sea  level  pressure  of  cyclones  by 
storm  type.  In  this  way,  she  was  able  to  document  characteristics 
of  model  simulations  of  storms  in  the  developing,  mature  and  de¬ 
caying  stages.  Pertinent  conclusions  from  her  study  were:  the 
depth  of  oceanic  cyclones  is  generally  underforecast;  the  depth  of 
cyclones  forming  on  the  lee  slopes  of  the  Rocky  Mountains  is 
generally  overforecast;  most  forecasts  of  thickness  over  land  are 
too  warm;  and  many  forecasts  of  thickness  over  the  oceans  are  too 
cold.  No  attempt  has  been  made  to  correct  these  biases  found  by 
Leary . 

The  purpose  of  this  thesis  is  to  undertake  a  study  to  deter¬ 
mine  the  systematic  bias  in  the  winter  season  geopotential  height 
field  of  NMC's  six- layer  primitive  equation  (PE)  model.  Although 
other  researchers  have  documented  the  systematic  bias  in  the  PE 
model,  none  have  computed  the  bias  at  all  gird  points  for  a  long 
term.  Patterns  of  geographical  bias  found  over  the  hemisphere 
and  related  to  characteristics  of  the  mean  state  of  the  atmosphere. 
It  should  be  noted  that  this  study  covers  the  period  after  the  PE 
was  improved  on  the  basis  of  Fawcett's  results.  In  addition  to 
the  determination  of  the  systematic  bias  in  the  PE,  several  other 
statistics  were  computed  in  order  to  gain  more  Information  about 
the  local  distribution  of  forecast  error. 
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2.  Data  Description 

The  observational  data  set  used  in  this  study  are  the  final 
analyses  of  the  500  and  1000  mb  geopotential  height  for  the  Northern 
Hemisphere  made  by  NMC.  These  data  were  obtained  from  the  archives 
of  the  National  Center  for  Atmospheric  Research  (NCAR) .  There  are 
two  maps  per  day,  00  and  12Z,  over  six  winter  seasons  beginning 
with  1973-74  and  ending  with  the  1978-79  winter  season.  A  winter 
season  is  defined  as  December,  January,  and  February.  The  fore¬ 
cast  data  set  is  comprised  of  24-,  48-,  and  72-hour  500  and  1000  mb 
forecasts  of  geopotential  height  over  the  same  period  and  tines  as 
the  observational  data.  The  forecasts  were  made  by  NMC's  six- 
layer  PE  developed  by  Shuman  and  Hovermale.  A  seven- layer  PE  was 
tested  and  became  operational  in  January  1978.  The  forecast  data 
were  also  obtained  from  NCAR.  All  statistics  were  calculated  on 
the  NMC  octagonal  grid  containing  1977  grid  points  and  results 
were  mapped  using  the  NCAR  graphics  software  without  any  smoothing 
or  Interpolation  to  latitude/longitude  grid  points.  The  study  is 
based  on  approximately  1030  24-  and  48-hour  500  mb  forecasts,  each 
of  which  are  equally  divided  between  the  00  and  12Z  initial  times. 
There  are  about  half  as  many,  approximately  550,  72-hour  fore¬ 
casts  which  were  made  mostly  from  00Z  initial  time.  There  were  48 
missing  grids  almost  equally  divided  amont  the  00  and  12Z,  500  and 
1000  mb  categories. 

3.  Preliminary  Comparison  of  Forecast  Fields  to  Climatology 

In  order  to  establish  a  climatology  for  this  study,  the  mean 
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aid  standard  deviation  of  the  observations  were  computed  for  both 
the  500  and  1000  mb  levels.  These  maps  are  presented  in  Figs.  3 
and  4.  Comparison  of  the  mean  height  of  the  500  and  1000  mb  le¬ 
vels  for  the  six  winter  seasons  in  this  study  (Fig.  3)  to  clima¬ 
tology  produced  by  other  researchers  [for  instance,  Blackmon  (1976) 
Fig.  3  and  Blackmon,  et  al.  (1977),  Fig.  1]  shows  good  agreement 
in  both  position  and  amplitude  of  major  standing  wave  troughs, 
ridges,  highs,  and  lows.  A  similar  comparison  of  the  standard 
deviation  fields  for  the  two  levels  produced  from  this  data  set 
(Fig.  4)  also  shows  good  agreement  in  the  position  and  amplitude  of 
the  maxima  over  the  northern  Atlantic  and  Pacific  oceans  but  dif¬ 
fers  with  respect  to  both  position  and  amplitude  of  the  maximum 
over  central  Asia. 

As  a  preliminary  comparison  between  forecast  and  observational 
climatologies,  the  mean  and  standard  deviation  for  both  the  500 
and  1000  mb  72-hour  forecasts  are  presented  in  Figs.  5  and  6,  re¬ 
spectively.  Comparison  of  Fig.  5a  with  the  mean  500  ofo  observa¬ 
tional  climatology  (Fig.  3a)  shows  good  agreement  with  respect  to 
the  position  of  the  mean  troughs  and  ridges,  but  the  mean  500  mb 
forecast  height  field  tends  to  be  higher  in  the  polar  regions  and 
lower  in  the  subtropics.  At  1000  mb,  the  mean  forecast  climatology 
(Fig.  5b)  is  characterized  by  weaker  lows  in  the  northern  Atlantic 
and  Pacific  oceans  and  a  weaker  Siberian  high  than  the  observed 
1000  mb  height  field  (Fig.  3b).  The  standard  deviation  of  the 
500  mb  forecast  field  (Fig.  6a)  appears  quite  similar  to  the 


Mean  observed  geopotential  height  at  (a)  500mb  and  (b)  lOOOmb  based  on  six  winter  seasons 
contour  interval  for  (a)  60m  and  for  (b)  20m. 


Fig.  4.  Standard  deviation  of  the  observed  geopotential  height  for  (a)  500mb  and  (b)  lOOOmb  based  on 
dally  data  for  winter;  contour  Interval  10m. 
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climatological  one  but  slightly  smaller  values  of  the  magnitude 
are  observed.  The  standard  deviation  of  the  1000  mb  forecast  (Fig. 
6b)  appears  similar  to  that  of  the  observation  except  that  in  the 
Tibetan  plateau  region,  the  model  produces  a  variance  approximately 
twice  as  large  as  the  observed.  Much  if  not  all  of  the  excess 
variance  in  this  region  may  be  associated  with  the  problems  of 
generating  a  model  sea  level  pressure  field  in  a  region  of  high 
terrain.  In  summary,  the  forecast  fields  are  characterized  by 
weaker  mean  flows  than  the  observed  at  both  levels  and  they  exhibit 
slightly  less  variability  about  the  time  mean.  A  more  detailed 
discussion  of  the  differences  and  determination  of  the  systematic 
bias  of  the  forecasts  will  be  presented  in  Section  6. 

4.  Statistical  Methods 

A  summary  and  discussion  of  currently  used  verification  tech¬ 
niques  and  the  organizations  which  actively  participate  in  model 
verification  programs  can  be  found  in  Weather  Forecasting  and  Wea¬ 
ther  Forecasts;  Models,  Systems,  and  Users,  Summer  Colloquium, 
1976,  NCAR,  Boulder,  Colorado.  The  most  common  objective  veri¬ 
fication  statistics  used  by  operational  forecast  centers  (such 
as  the  U.  S.  Air  Force  Global  Weather  Center  and  NMC)  are  the 
mean  algebraic  error,  the  mean  absolute  error,  the  standard  devi¬ 
ation  of  the  error,  NMC’s  SI  score,  the  root-mean-square-error, 


and  the  correlation  coefficient  between  the  forecast  and  ob¬ 
servation.  These  are  the  so-called  "single  valued"  statistics. 
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Frequently,  more  sophisticated  objective  verification  statistics 
and  techniques  are  used  but  primarily  on  research  models.  Since 
the  primary  goal  of  this  study  is  to  determine  the  geographical 
distribution  of  the  bias  and  the  non-systematic  errors,  only  the 
mean  algebraic  error  or  bias,  and  the  standard  deviation  of  the 
error  were  computed. 

The  mathematical  definitions  of  the  bias,  the  standard  devi¬ 
ation  of  the  error,  and  the  root-mean  square  error  are: 


Forecast  Error 


Bias 


E.  =  F. 
1  1 


0 


i 


(E)  =  Z  (E .  )  /N 
i=l 


Standard  Deviation 
of  the  Error 


(e’2)1'*  -  [I  (e  -  e )2mh 

i=l 


(1) 

(2) 

(3) 


Root-Mean-Square  '  2^  =  y  (E2)/N]4 

Error  ( r.m.s ,)  v  i 


(4) 


where  E  is  the  forecast,  0  is  the  observation,  and  (E)  is  the  tem¬ 
poral  mean.  The  summations  are  taken  over  the  six  winter  seasons. 

—  ,2  \ 

The  relationship  between  (E)  and  (E*  )  can  be  more  clearly 
understood  by  examination  of  Fig.  7  which  shows  the  statistics  for 
a  hypothetical  grid  point.  The  model  shows  some  skill  forecasting 
the  non-systematic  part  of  the  error  as  evidence  by  the  fact  that 
(E ’  ) 4  <  (O'  )  a.  Some  small  fraction  of  the  r.m.s.  error  for  this 


hypothetical  grid  point  is  due  to  the  positive  bias  of  the  forecast 
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500mb  Geopotential  Height  (meters) 
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relative  to  the  observation. 

A  measure  of  skill  of  a  model  is  (E'4-)  2  or  (E^)  2  normalized 
by  the  climatological  standard  deviation  of  the  variable  in  ques¬ 
tion.  The  first  ratio  is  a  measure  of  the  non-systematic  error 
of  the  model  while  the  second  ratio  includes  both  the  systema¬ 
tic  and  non-systematic  components  of  the  error.  In  either  case, 
the  lower  the  ratio,  the  more  skillful  the  model.  For  a  model 

—  —  2  2 

with  no  skill  for  which  F  =  0  and  F'  =  O'  ,  it  can  be  shown  that 
E2  =  2  0 ' 2  . 


A  qualitative  graphical  representation  of  the  growth  of  the 
error  as  measured  by  the  r.m.s.  error,  the  theoretical  limit  of 
error  growth  in  a  perfect  model,  and  the  range  of  error  growth 
of  the  current  state  of  the  art  in  NWP  is  shown  in  Fig.  8.  Here 
the  growth  of  the  error  in  a  perfect  model  is  due  only  to  the  in¬ 
herently  limited  predictability  of  the  atmosphere  and  not  to  any 
error  in  the  initial  conditions  or  deficiencies  in  the  model's 
representation  of  dynamical  or  physical  processes.  Many  studies 
of  forecast  skill  use  this  type  of  graph  as  one  tool  for  compari¬ 
son. 

Another  measure  of  skill  is  the  correlation  coefficient  be¬ 
tween  the  forecast  and  the  observation.  The  correlation  coeffi¬ 
cient  between  these  two  variables  can  be  computed  using  equation 
(5). 


500  MB  HEIGHT  ERROR 
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Fig.  8 


.  Qualitative  comparison  of  "perfect  model"  forecast  skill 
with  that  of  persistence,  climatology,  and  present  models, 
as  measured  by  the  r.m.B.  500mb  height  error  (Leith, 1976) . 
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r(F,0> 


n2^  /’n'2\J5 


(O'2)*5 


(5) 


The  second  form  of  the  equation  is  convenient  for  determining 
r(F,0)  because  only  the  standard  deviation  of  the  forecast  is  re¬ 
quired  in  addition  to  the  already  computed  standard  deviation  of 
the  forecast  error  and  the  climatological  standard  deviation. 

By  breaking  up  the  total  error  into  systematic  and  non-sys- 
tematic  parts,  the  percent  of  the  total  variance  due  to  the  bias 
can  be  determined. 


%  Variance  _  E^  _  E2 
due  to  bias  — 5-  — -  — 

E  1  +  E’ 


(6) 


This  statistic  gives  an  indication  pf  the  seriousness  of  the  error 
associated  with  the  bias.  Where  the  bias  contributes  a  signifi¬ 
cant  part  of  the  error,  it  might  be  advisable  to  correct  the  fore¬ 
cast  for  the  bias  at  those  giid  points  most  seriously  affected. 

Additional  information  about  the  model  characteristics  can 
be  gained  by  comparing  the  error  statistics  for  the  500  and 
1000  mb  levels .  The  correlation  between  the  error  at  the  2  levels 
and  the  ratio  of  the  standard  deviation  of  the  errors  at  the  two 
levels  are  two  statistics  which  will  give  an  indication  of  the 
vertical  structure  of  the  error  field. 

A  comparison  of  the  normalized  standard  deviation  of  the  er- 
2  2  h  2  2  h 

tots,  (E'  /O'  or  (E  /O'  )  ,  between  the  two  levels  gives  an 
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Indication  of  whether  the  model  Is  more  skillful  at  500  mb  level 
than  at  the  surface.  Still  other  insights  into  the  model  charac¬ 
teristics  can  be  gained  by  analyzing  the  time  series  of  the  fore¬ 
cast,  observation,  and  forecast  error  at  individual  grid  points. 
For  example,  the  cross  correlation  of  the  forecast  and  observed 
time  series  gives  information  concerning  systematic  phase  errors 
in  the  tests.  All  of  these  statistics  and  comparisons  were  per¬ 
formed  and  will  be  discussed  in  section  6. 

An  important  source  of  uncertainty  in  this  study  is  the  as¬ 
sumption  that  the  final  NMC  objective  analysis  perfectly  matches 
the  actual  state  of  the  atmosphere.  It  has  been  shown  by  Oort 
(1978)  in  his  study  of  the  adequacy  of  the  rawinsonde  network, 
that  such  analyses  may  contain  significant  errors.  On  the  basis 
of  experiments  with  a  simulated  network  of  radiosonde  stations 
embedded  in  a  General  Circulation  Model  grid,  he  estimated  that 
the  root-mean-square  error  averaged  over  the  month  of  January 
using  daily  values  is  on  the  order  of  14m.  It  will  be  shown 
in  the  next  section  that  some  of  the  largest  biases  occur  in  re¬ 
gions  of  relatively  dense  radiosonde  observations. 

5.  Investigation  of  Effects  of  Different  Analysis  Procedures 

During  the  period  of  study,  three  different  objective  analy¬ 
sis  schemes  were  operational  at  NMC:  the  Cressman  routine  (Cress- 
man,  1959)  which  was  operational  until  September  1974;  the  Flat¬ 
tery-Hough  routine  (NWS,  Tech.  Proc.  Bull.,  1974)  which  was  used 
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from  that  time  through  September  1978,  and  the  present  routine, 
the  Optimum  Interpolation  method  (McPherson,  1979),  was  opera¬ 
tional  during  the  last  winter  season  in  this  data  set.  To  test  the 
influence  of  the  different  analysis  schemes  on  the  statistics  pre¬ 
sented  here,  the  mean  error  of  the  500  mb  72-hour  forecast  was 
computed  for  the  periods  in  which  each  of  the  three  analysis  rou¬ 
tines  were  in  use.  The  results  for  the  periods  during  which  the 
Flattery-Hough  and  the  Optimum  Interpolation  routines  were  used 
are  shown  in  Fig.  9.  If  one  bears  in  mind  that  the  Flattery- 
Hough  statistics  are  based  on  four  winter  seasons  of  forecasts 
and  the  Optimum  Interpolation  statistics  are  based  on  only  one, 
the  patterns  are  perhaps  as  similar  as  one  can  expect.  Thus  it 
is  concluded  that  the  objective  analysis  schemes  had  little  ef¬ 
fect  on  the  systematic  errors  of  the  model. 

6.  Error  Statistics 

a.  Systematic  Errors 

Shown  in  Fig.  10  are  mean  error  patterns  of  the  PE  at  the  500  mb 
level.  At  24  hours,  a  negative  bias  is  present  over  low  latitude 
oceans  and  a  generally  positive  bias  poleward  of  408N.  By  72  hours, 
there  is  a  stronger  longitudinal  dependence  of  the  bias  with  posi¬ 
tive  bias  in  the  mean  troughs  and  over  the  polar  region,  and  nega¬ 
tive  bias  in  the  mean  ridges  and  low  latitudes.  The  bias  over 
the  pole  increases  by  about  20  meters  per  24  hours,  which  is  typi¬ 
cal  of  the  largest  errors  at  this  level. 


Fig.  9.  Mean  error  for  the  500mb  72-hour  forecast  for  the  period  in  which  the  (a)  Flattery-Hough  and 
(b)  Optimum  Interpolation  objective  analysis  schemes  were  operational;  contour  interval  10m. 


Fig.  10c. 


As  in  Fig.  10a  except  for  72-hour  forecast.  Dots 
indicate  grid  points  at  which  cross  spectrum  analysis 
was  performed. 
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Shown  in  Fig.  11  are  the  corresponding  mean  error  patterns  at 
the  1000  mb  level.  In  contrast  to  the  500  mb  height  error,  the 
1000  mb  height  errors  do  not  show  as  much  of  a  latitudinal  gra¬ 
dient  of  the  bias.  The  bias  pattern  is  generally  oriented  toward 
negative  bias  over  continents ,  and  slightly  negative  bias  over 
the  low  latitude  oceans.  By  72  hours,  centers  of  positive  and 
negative  bias  closely  match  the  positions  of  the  climatological 
mean  wintertime  lows  and  highs. 

The  1000  mb  bias  fields  (Fig.  11)  reveals  sene  features 
which  may  be  associated  with  major  mountain  ranges.  There  tends 
to  be  a  west-to-east ,  positive-to-negative  bias  across  the  moun¬ 
tain  ranges.  For  example,  the  24  hour  forecast  error  field  shows 
a  center  of  negative  bias  located  in  the  Gulf  of  Mexico.  At  48 
hours  it  begins  to  expand  north  along  the  east  slope  of  the  Rocky 
Mountains,  and  becomes  more  negative  at  72  hours.  At  the  same 
time,  a  ridge  of  positive  bias  develops  over  and  to  the  west  of 
the  range.  Other  mountain  ranges  where  this  cross  range  bias  is 
seen  are  over  the  Himalayas  and  the  Alps . 

The  thickness  error  was  computed  from  the  mean  500  and  1000  mb 
error  fields  using  the  relation, 


(F-0) 


500 


(F-0) 


1000 


(F  -F  ) 
v  500  1000; 


^SOO^IOOO* 


(7) 


Fig.  12  shows  the  thickness  error  pattern  for  the  72-hour  forecast. 
It  is  particularly  obvious  that  the  mean  500  mb  troughs  are  too 
warm  and  the  mean  ridges  are  too  cold. 
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Fig.  11a.  Mean  error  for  the  lOOOmb  24-hour  forecast;  contour 
interval  10m 
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It  is  evident  from  these  results  that  the  model  systematically 
damps  out  the  climatological  mean  surface  highs  and  lows,  and 
500  mb  troughs  and  ridges.  This  behavior  has  been  detected  pre¬ 
viously  by  other  researchers.  Fawcett  (1969)  in  his  study  of  the 
systematic  error  in  the  36-hour  500  mb  PE  model  forecast  (1964- 
1968)  detected  the  tendency  for  the  model  to  flatten  out  the  fields 
not  only  in  mean  monthly  charts,  but  also  in  daily  error  charts. 
Baumhefner  (1978)  noted  the  growth  in  the  positive  bias  at  higher 
latitudes  as  the  forecast  period  increased  which  is  a  tendency  con¬ 
sistent  with  the  growth  of  the  positive  bias  over  the  pole  ob¬ 
tained  in  the  present  study.  It  is  interesting  to  note  that  sys¬ 
tematic  damping  of  the  amplitude  of  the  atmospheric  fields  also 
has  been  observed  to  occur  in  individual  disturbances.  For  exam¬ 
ple,  Baumhefner  (1978)  also  noted  that  the  PE  underforecasted  the 
amplitude  of  the  large  scale  waves  on  a  day-to-day  basis  and  at 
ail  forecast  times. 

b.  Non-Systematic  Errors 

2  2 

Shown  in  Fig.  13  is  the  ratio,  (F'  /O’  )  Over  most  of  the 
mid-latitudes  poleward  of  30°S,  this  ratio  is  close  to  unity.  Even 
though  it  was  demonstrated  that  the  PE  model  shows  a  tendency  to 
systematically  underforecast  climatological  features  as  well  as 
individual  disturbances,  the  closeness  of  this  ratio  to  one  in¬ 
dicates  that  the  PE  model  is  able  to  simulate  real  atmospheric 
fluctuations  more  accurately  than  expected. 

Shown  in  Fig.  14  and  15  are  the  patterns  of  standard  deviation 


s: 


Ratio  of  the  standard  deviation  of  the  500mb  72-hour 
forecast  to  the  500mb  climatological  standard  deviation 
of  the  observation;  contour  interval  0.1. 
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of  the  error  at  the  500  and  1000  mb  levels  respectively.  At  both 
levels,  it  can  be  seen  that  there  is  a  monotonic  increase  in  the 
non-systematic  error  over  the  forecast  period  at  almost  all  grid 
points.  Also,  the  shape  of  the  standard  deviation  patterns  clear¬ 
ly  resemble  the  climatological  standard  deviation  as  can  be  seen 
by  comparison  with  Fig.  4.  They  even  more  strongly  resemble  the 
bandpass  filtered  patterns  shown  in  Fig.  5d,  [Blackmon  (1976)], 
and  Fig.  2b,  [Blackmon,  et  al.  (1977)].  In  regions  with  large 
day-to-day  variability,  such  as  the  mid-latitude  storm  tracks, 
the  standard  deviation  of  the  error  is  relatively  large  while  in 
regions  with  small  day-to-day  variability,  such  as  the  tropics, 
the  standard  deviation  of  the  error  is  relatively  small. 

Shown  in  Fig.  16  is  geographical  distribution  of  the  normal- 
ized  standard  deviation  of  the  error,  (E*  /O'  )2  for  both  the  500 
and  1000  mb  72-hour  forecasts.  Note  that  the  normalized  fields 
are  relatively  featureless  except  for  the  monotonic  increase  as 
one  approaches  the  equatorward  edge  of  the  grid. 

The  correlation  coefficient  between  the  500  mb  72-hour  fore¬ 
cast  and  the  verifying  observation,  as  computed  from  Eq.  5,  was 
used  as  another  measure  of  forecast  skill.  Contours  of  correla¬ 
tion  coefficients  plotted  at  each  grid  point  are  shown  in  Fig.  17. 
Comparison  to  the  normalized  standard  deviation  field  (Fig.  16a) 
shows  the  same  general  shape  with  duplication  of  all  major  fea¬ 
tures.  Note  the  same  monotonic  pole-to-equator  decrease  in  skill 


of  the  forecast. 


Fig,  14a.  Standard  deviation  of  the  500mb  24-hour  forecast  error; 
contour  interval  10m. 


Normalized  standard  deviation  of  the  error  for  the  (a)  500mb  and  (b)  lOOOmb  72-hour 
forecasts;  contour  Interval  0.1. 


Fig,  17.  Correlation  coefficients  between  the  forecast  and  observa¬ 
tion  for  the  500mb  72-hour  forecast;  contour  interval  0.1 
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In  order  to  gain  an  appreciation  of  the  growth  of  the  error 
at  both  the  500  and  1000  mb  levels,  graphs  of  the  area-averaged 
standard  deviation  of  the  error,  non-systematic  error,  and  total 
error  are  plotted  versus  forecast  time  in  Fig.  18.  The  area-aver¬ 
aged  c 1 imatological  standard  deviation  derived  from  this  data  set 
is  plotted  as  a  horizontal  line.  At  both  levels,  the  PE  shows 
skill  relative  to  climatology  out  to  at  least  72  hours.  The  500  mb 
errors  are  larger  in  an  absolute  sense,  but  when  normalized  by 
dividing  through  by  the  climatological  standard  deviation,  they  are 
smaller  than  the  corresponding  1000  mb  errors. 

The  percent  of  the  total  variance  of  the  error  due  to  the 

~~2  ~~2  2 

bias,  E  /(E  +  E'  ),  was  calculated  for  the  500  mb  72-hour  fore¬ 

cast  and  is  plotted  in  Fig.  19.  Most  of  the  error  is  due  to  the 
non-systematic  part  of  the  total  error  as  indicated  by  the  low 
values,  but  the  bias  contributes  a  significant  part  of  the  total 
error  over  the  pole,  at  low  latitudes,  and  in  the  mean  troughs. 

c .  Vertical  Structure  of  the  Error 

Various  statistics  were  calculated  and  compared  between  the 
500  and  1000  mb  levels  in  order  to  determine  the  vertical  struc¬ 
ture  of  the  error  field.  The  ratio  of  the  500  to  1000  mb  standard 

2  2  h 

deviation  of  the  error,  (E'  ,  tor  t*ie  ?2-hour  f°re_ 

cast  is  plotted  in  Fig.  20.  Comparing  this  to  Fig.  5a  of  Blackmon, 
et  al .  (1979)  which  shows  the  ratio  of  the  500  to  1000  mb  clima¬ 
tological  standard  deviations,  one  notes  the  similarity  in  the 
shape  of  the  two  fields.  This  similarity  implies  a  close  associa- 


Fig.  19.  Percent  of  total  variance  of  the  error  due  to  the  bias 
for  the  500mb  72-hour  forecast;  contour  interval  0.1. 
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tion  between  the  vertical  structure  of  the  error  and  that  of  cli¬ 
matology. 

The  correlation  coefficient  between  the  500  and  1000  mb  error 
for  the  24-  and  72-hour  forecast  was  computed  for  each  grid  point 
over  the  six  winter  seasons.  Results  are  shown  in  Fig.  21.  The 
72-hour  error  correlation  (Fig.  21a)  shows  a  remarkable  resemblance 
to  the  correlation  between  the  500  and  1000  mb  observed  height 
fluctuations  [Blackmon,  et  al.  (1979),  Fig.  3a]  with  respect  to  the 
shape  of  the  fields  with  only  slight  differences  in  the  magnitudes 
of  the  maxima  and  minima  centers.  The  significance  of  this  strik¬ 
ing  similarity  is  that  the  vertical  structure  of  the  error  field 
is  not  random,  but  it  is  much  like  that  of  real  atmospheric  fluc¬ 
tuations.  The  correlation  map  for  the  24-hour  forecast  error  is 
shown  in  Fig.  21b.  This  distribution  shows  some  resemblance  to 
the  observed  fields  but  to  a  much  lesser  extent.  The  regional 
contrasts  are  not  as  large  as  those  for  the  72-hour  forecast 
errors. 

The  ratio  of  the  500  to  1000  mb  normalized  standard  deviation 
of  the  error,  (E'  2/0'  2/°' 2)  1000’  gives  a  comparison  of 

the  model  performance  at  the  two  levels.  It  is  widely  believed 
that  forecasts  are  more  skillful  at  500  mb.  The  area-averaged 
statistics  (Fig.  18)  seem  to  support  this  idea.  Values  of  the 


above  ratio,  averaged  over  the  NMC  grid,  are  .79  for  the  24-hour 
forecasts,  .80  for  the  48-hour  forecasts,  and  .89  for  the  72-hour 
forecasts.  But  these  ratios  are  not  uniform  over  the  entire  grid. 


Fig.  21.  Correlation  coefficients  between  the  500  and  lOOOmb  mean  errors  for  the  (a)  72>!-hour  and 
(b)  24-hour  forecast;  contour  interval  0.1. 
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In  fact,  there  are  some  regions  where  the  model  performs  better 
at  1000  mb  than  at  500  mb.  Fig.  22  shows  the  geographical  distri¬ 
bution  of  this  ratio  for  the  72-hour  forecasts.  Over  the  eastern 
U.S.  and  eastern  Asia,  the  model  performs  considerably  better  at 
the  mid-tropospheric  levels  than  at  the  ground.  However,  over 
the  eastern  Atlantic  and  Europe,  performance  at  the  two  levels 
tends  to  be  roughly  comparable. 

d.  Systematic  Phase  Errors 

In  previous  sections  it  was  shown  that  the  PE  model  is  char¬ 
acterized  by  systematic  forecast  errors  which  tend  to  damp  out  the 
mean  zonal  flow  and  the  forced  planetary  waves.  These  forecast 
errors  lead  to  a  progressive  weakening  of  the  planetary  scale  wind 
field  throughout  the  forecast  interval.  Taking  the  gradient  of 
the  height  error  from  the  center  to  the  southern  edge  of  the  po¬ 
sitive  bias  regions  off  the  eastern  coasts  of  Northern  American 
and  Asian  continents  (Fig.  10c),  the  error  in  the  geostrophic  wind 
was  calculated  to  be  3.5  and  7.5  m/s  (or  6.3%  and  8.3%,  respective¬ 
ly)  slower  than  the  observed  wind. 

It  is  of  interest  to  determine  whether  this  erroneous  weak¬ 
ening  of  the  planetary  scale  flow  with  time  contributes  to  the  non- 

,  ,  2  k 

systematic  component  of  the  error,  (E  )  ,  by  virtue  of  its  effect 
in  the  phase  speeds  of  transient  weather  systems.  One  would  expect 
the  resulting  underestimate  of  the  phase  speeds  to  be  most  serious 
in  the  jetstream  regions  where  the  systematic  underestimate  of  the 
zonal  wind  is  larger  and  the  transient  variability  is  also  large. 
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Ratio  of  the  500  to  lOOOmb  normalized  standard  deviation 
of  the  error  for  the  72-hour  forecast;  contour  interval  0.1 
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In  order  to  determine  the  seriousness  of  underestimated  phase 
speeds  as  a  potential  source  of  error  in  the  forecast,  the  cross 
spectra  was  computed  between  the  72-hour  forecasts  and  observations 
at  the  nine  points  indicated  by  the  dots  in  Fig.  10c,  which  include 
two  jetstream  locations.  Cross  spectra  were  computed  for  each 
winter  season  individually  and  then  averaged  over  the  six  seasons. 
The  time  series  used  were  the  500  mb  72-hour  forecast  and  verify¬ 
ing  observations  (00Z  data  only) .  Each  season  contained  ninety 
data  points  spaced  24  hours  apart.  In  order  to  obtain  reliable 
spectral  estimates,  a  lag  of  5  days  was  used  giving  18  degrees  of 
freedom.  The  results  of  the  cross  correlation  for  four  of  the  nine 
grid  points  are  shown  in  Table  1.  In  all  spectral  bands  for  all 
nine  grid  points  including  the  results  of  regions  not  shown,  there 
were  negative  phase  errors.  But  because  the  squared  coherence  is 
low  at  the  higher  frequencies,  the  confidence  in  these  phase  es¬ 
timates  is  marginal.  Using  tables  to  determine  the  reliability 
of  the  spectral  estimates,  the  squared  coherence  must  be  greater 
than  .239(.162)  for  the  99%(95%)  confidence  level.  So,  at  most 
of  the  grid  points,  confidence  can  be  given  to  phase  angles  through 
the  first  three  or  four  phase  estimates.  In  these  lowest  fre¬ 
quency  spectral  bands,  phase  lags  are  generally  between  -0.01  to 
-0.11  cycles.  With  such  small  phase  lags  along  with  the  fact  that 
the  forecasted  mean  zonal  wind  was  only  6  to  8%  too  slow,  it  can 

be  concluded  that  the  systematic  phase  lage  contributes  only  a 
2  4 

minor  part  to  (E*  )  . 


2163.1  2632.7  998.3  -  783.3 
1485.7  1396.5  62.8  -  417.2 
1383.6  1121.2  -248.8  -  148.8 
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7 .  Discussion 

The  biases  found  in  this  study  at  the  500  mb  level  were  quite 
small  and  insignificant  over  most  of  the  grid  when  compared  to  the 
r.m.s.  error.  It  is  only  around  the  periphery  of  the  grid,  over 
the  pole,  and  in  the  standing  wave  troughs  that  the  bias  contributes 
a  significant  part  of  the  error.  Empirical  corrections  after  each 
forecast  would  eliminate  the  bias  but  would  probably  not  be  noticed 
in  the  model  simulation  of  atmospheric  disturbances.  Improvement 
in  the  r.m.s.  error  statistic  would  be  slight.  The  important  as¬ 
pect  of  this  finding  is  that  the  PE  model  flattens  out  the  cli¬ 
matological  mean  fields  at  both  the  surface  and  mid-tropospheric 
levels . 

The  day-to-day  variability  of  the  error  fields  is  closely 
associated  with  the  day-to-day  variability  of  the  observed  field. 
This  resemblance,  as  seen  by  comparison  of  the  standard  deviation 
of  the  error  to  the  climatological  standard  deviation,  is  present 
throughout  the  forecast  period.  The  rate  of  growth  of  the  non- 
systematic  error  and  the  amplitude  of  the  error  fluctuations  are 
largest  over  the  northern  oceanic  storm  track  regions  while  they 
are  smallest  over  the  low  latitude  tropics.  However,  when  the  am¬ 
plitudes  of  the  non-systematic  error  fluctuations  are  compared  to 
the  climatological  variability,  the  degradation  in  the  forecast, 
as  manifested  by  the  pole-to-equator  increase  of  the  normalized 
error  fields,  is  greatest  approaching  the  equatorward  edge  of 


the  grid. 
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The  vertical  structure  of  the  fluctuations  in  the  error  field 
closely  resemble  the  vertical  structure  of  real  atmospheric  fluctua 
tions.  The  500  to  1000  mb  errors  are  highly  correlated  in  geograph 
ical  locations  where  the  observed  500  and  1000  mb  surfaces  are 
highly  correlated;  the  errors  are  poorly  correlated  where  observed 
500  and  1000  mb  are  poorly  correlated. 
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